GPU Accelerated Radio Astronomy Signal Convolution
نویسندگان
چکیده
The increasing array size of radio astronomy interferometers is causing the associated computation to scale quadratically with the number of array signals. Consequently, efficient usage of alternate processing architectures should be explored in order to meet this computational challenge. Affordable parallel processors have been made available to the general scientific community in the form of the commodity graphics card. This work investigates the use of the Graphics Processing Unit (GPU) in the parallelisation of the combined conjugate multiply and accumulation stage of a correlator for a radio astronomy array. Using NVIDIA’s Compute Unified Device Architecture, our testing shows processing speeds from one to two orders of magnitude faster than a Central Processing Unit (CPU) approach.
منابع مشابه
GPU Accelerated Source Extraction in Radio Astronomy: A CUDA Implementation
.................................................................................................................................................. ii
متن کاملImplementation of Digital Signal Processing Algorithm in General Purpose Graphics Processing Unit (GPGPU)
In this paper, we have proposed sequential and parallel matrix and matrix-vector multiplication in compute unified device architecture (CUDA) libraries. We show the process of a class of algorithms parallelization which are used in digital signal processing. We present this approach on the instance of the Linear Convolution, Circular Convolution, and Least Mean Square (LMS) algorithm. We propos...
متن کاملScaling Radio Astronomy Signal Correlation on Heterogeneous Supercomputers Using Various Data Distribution Methodologies
Next generation radio telescopes will require orders of magnitude more computing power to provide a view of the universe with greater sensitivity. In the initial stages of the signal processing flow of a radio telescope, signal correlation is one of the largest challenges in terms of handling huge data throughput and intensive computations. We implemented a GPU cluster based software correlator...
متن کاملConvolution of large 3D images on GPU and its decomposition
In this article, we propose a method for computing convolution of large 3D images. The convolution is performed in a frequency domain using a convolution theorem. The algorithm is accelerated on a graphic card by means of the CUDA parallel computing model. Convolution is decomposed in a frequency domain using the decimation in frequency algorithm. We pay attention to keeping our approach effici...
متن کاملAutomatic Mapping of Real Time Radio Astronomy Signal Processing Pipelines onto Heterogeneous Clusters
Automatic Mapping of Real Time Radio Astronomy Signal Processing Pipelines onto Heterogeneous Clusters by Terry Esther Filiba Doctor of Philosophy in Engineering – Electrical Engineering and Computer Sciences University of California, Berkeley Professor John Wawrzynek, Co-chair Daniel Werthimer, Co-chair Traditional radio astronomy instrumentation relies on custom built designs, specialized for...
متن کامل